Analyzing LOFAR station processing on multi-core platforms

نویسندگان

  • Rik Jongerius
  • Henk Corporaal
  • P. Chris Broekema
  • Ton Engbersen
چکیده

The LOw-Frequency ARray (LOFAR) is a phasedarray radio telescope in the Netherlands. Currently, the astronomical community is embarking upon designing a new telescope, the Square Kilometer Array (SKA). As many of the LOFAR techniques form the basis for several SKA modes of operation, a retrospective analysis of LOFAR, reviewing its design given recent advances in technology, is of interest. This work is part of a retrospective analysis of LOFAR and evaluates the digital part of the station processing on a multicore platform. An implementation is executed on a quad-core Intel Core∗ i7-3820, using its SSE and AVX instructions. Results show there is no obvious optimal data layout in memory nor parallelization strategy, as choices improve performance of certain algorithms and decrease performance of others. Using AVX instructions and HyperThreading results in best performance. A roofline analysis is performed, comparing the current Virtex4 implementation in the field with an Intel Core i7 implementation. Further improvements of the multi-core implementation, to reach performance of the Virtex-4 platform, include using more physical processors or accelerators such as GPUs. A comparison with Virtex-7 x690T FPGAs is made and, based on the roofline model, it is approximated that 3 state-of-the-art Virtex-7 FPGAs would be required. The results support the choice for FPGAs in the current LOFAR design.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Efficient parallelization of the genetic algorithm solution of traveling salesman problem on multi-core and many-core systems

Efficient parallelization of genetic algorithms (GAs) on state-of-the-art multi-threading or many-threading platforms is a challenge due to the difficulty of schedulation of hardware resources regarding the concurrency of threads. In this paper, for resolving the problem, a novel method is proposed, which parallelizes the GA by designing three concurrent kernels, each of which running some depe...

متن کامل

LOFAR: Detecting Cosmic Rays with a Radio Telescope

LOFAR (the Low Frequency Array), a distributed digital radio telescope with stations in the Netherlands, Germany, France, Sweden, and the United Kingdom, is designed to enable full-sky monitoring of transient radio sources. These capabilities are ideal for the detection of broadband radio pulses generated in cosmic ray air showers. The core of LOFAR consists of 24 stations within 4 square kilom...

متن کامل

A Design Framework for Mapping Dataflow Graphs onto Heterogeneous Multiprocessor Platforms

Dataflow models are valuable tools for representing, analyzing, and synthesizing embedded systems. Heterogeneous computing platforms with multi-core CPU and Graphics Processing Units (GPUs) provide a low cost platform for high performance computations. In this report, we present a dataflow based automated design framework that incorporates analysis, optimization and synthesis tools for embedded...

متن کامل

Antenna design and distribution for a LOFAR Super Station in Nançay

The Nançay radio astronomy observatory and associated laboratories are developing the concept of a “Super Station” for extending the LOFAR station now installed and operational in Nançay. The LOFAR Super Station (LSS) will increase the number of high sensitivity long baselines, provide short baselines and an alternate core, and be a large standalone instrument. It will operate in the low freque...

متن کامل

Analyzing the Effect of Different Programming Models Upon Performance and Memory Usage on Cray XT5 Platforms

Harnessing the power of multicore platforms is challenging due to the additional levels of parallelism present. In this paper, we examine the effect of the choice of programming model upon performance and overall memory usage on the Cray XT5. We use detailed time breakdowns to measure the contributions to the total runtime from computation, communication, and OpenMP regions of the applications,...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012